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Abstract 

Mitogen-activated protein kinase (IVIAPK) cascades have important functions in plant growth, development, and response to 
various stresses. The MAPKK and IVIAPKKK gene families in tomato have never been systematically analyzed. In this study, 
we performed a genome-wide analysis of the MAPKK and MAPKKK gene families in tomato and identified 5 MAPKK genes 
and 89 MAPKKK genes. Phylogenetic analyses of the MAPKK and MAPKKK gene families showed that all the MAPKK genes 
formed four groups (groups A, B, C, and D), whereas all the MAPKKK genes were classified into three subfamilies, namely, 
MEKK, RAF, and ZIK. Evolutionary analysis showed that whole genome or chromosomal segment duplications were the main 
factors responsible for the expansion of the MAPKK and MAPKKK gene families in tomato. Quantitative real-time RT-PCR 
analysis showed that the majority of MAPKK and MAPKKK genes were expressed in all tested organs with considerable 
differences in transcript levels indicating that they might be constitutively expressed. However, the expression level of most 
of these genes changed significantly under heat, cold, drought, salt, and Pseudomonas syringae treatment. Furthermore, 
their expression levels exhibited significant changes in response to salicylic acid and indole-3-acetic acid treatment, 
implying that these genes might have important roles in the plant hormone network. Our comparative analysis of the 
MAPKK and MAPKKK families would improve our understanding of the evolution and functional characterization of MAPK 
cascades in tomato. 
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Introduction 

Mitogen-activated protein kinase (MAPK) cascades, whicli are 
widely distributed in eukaryotes, have an important function in the 
diverse developmental and physiological processes of plants, and in 
response to various biotic and abiotic stresses [1,2]. Each MAPK 
cascade consists of three protein kinases: MAPKs, MAPK kinases 
(MAPKKs/MKKs), and MAPKK kinases (MAPKKKs/ 
MEKKs). MAPKKK activates MAPKK through the phosphor- 
ylation of serine and serine/threonine residues in its T-loop. 
Similarly, MAPKK activates MAPK through the phosphorylation 
of tyrosine and threonine residues in the TEY or TDY motif 
between kinase subdomains VII and VIII. Plant MAPK cascade 
genes were first reported in Arabidopsis. Up to now, MAPK 
cascade family genes have been identified in many other plant 
species, including poplar, rice, soybean, maize, tomato and N. 
benthamiana [3-12]. 

In plants, the number of MAPKK family genes varies markedly 
across species. The estimated number is 10 in Arabidopsis, eight in 
rice, 1 1 in soybean, 1 1 in poplar and 1 2 in Bmchypodium 



distachyon. According to phylogenetic analyses, all MAPKKs can 
be classified into four groups (groups A-D). The function of the 
group A MAPKK members, mdudmgAiMAPKKl , AtMAPKK2, 
and AIMAPKK6, in Arabidopsis has been detailed studied. 
AIMAPKK2 is shown to play an important role not only mediates 
innate immunity responses but also has an important function in 
the cold and salt signaling transduction pathway [13,14]. 
AtMAPKKl also has essential functions in pathogen defence 
and have functional redundancy with AIMAPKK2 [13,15]. 
AIMAPKK6 acts in upstream oi AIMAPK13 in yeast cells [16]. 
Functional data on MAPKK family members belonging to the 
other three subgroups are limited. Only one AtMAPKK3 
belonging to group B was proved to function in pathogen 
resistance and participate in jasmonate signal transduction 
pathway in Arabidopsis [17,18]. 

MAPKKK is a large gene family containing 80, 75, 74 and 150 
members in Ar'abidopsis , rice, maize and soybean, respectively 
[7,19-21]. All MAPKKK genes have been divided into three 
major groups, namely, RAF, MEKK, and ZIK. The RAF 
subfamily in rice, maize, and Arabidopsis has 48, 43, and 46 
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members, respectively [20-22]. This subfamily contains a 
conser\'ed catalytic and RAF-specific signature GTXX (W/Y) 
MAPE [22]. Plant MEKK-like MAPKKK genes also harbor a 
conserved catalytic domain and conserved signature G (T/S) PX 
(F/Y/W) MAPEV, similar to animal MEKKs. The ZIK subfamUy 
contains a conserved signature GTPEFMAPE (L/V/M) (Y/F/L) 
across these members. Functional data on MEKK-like genes are 
more readily available than that on the other two subfamilies. In 
Amhidnpsis, three MAPKKK genes named ANPl, ANP2, and 
ANP3 have an important function in the signal transduction 
pathways that control plant cell division [23,24]. Another 
Arabidopsis MEKK-like gene (MEKKl) is involved in stress 
response and participates in signal transduction in diverse 
development process [25,26]. A MAPKKK gene, YODA regulates 
stomatal development in Arabidopsis [27]. The functional 
characteristics of MEKK-like genes from other species, such as 
tobacco and Solanum chacoense have been studied in depth 
[28,29]. In tomato, SlMAPKKKa positively regulates cell death 
associated with both plant immunity and disease resistance [30]. 
SlMAPKKKe is involved in signaling networks associated with 
plant immunity [31]. The functions of some RAF subfamily 
members have also been investigated in Arabidopsis and other 
plant species [3]. CTRl and EDRl, belonging to the Arabidopsis 
RAF MAPKKK subfamily, negatively regulates ethylene signaling 
transduction and participates in pathogen resistance [32,33]. 
MAP3K84, an Arabidopsis Raf-like MAP3K, has a function in the 
regulation of plant growth and shoot branching [34]. A rice RAF- 
like MAPKKK named ILAl regulates mechanical tissue forma- 
tion [35]. DSMl is involved in rice drought resistance [36]. 
Function analysis of ZIK-like genes is limited. However, most rice 
ZIK-like MAPKKK genes can be upregulated by at least one 
abiotic stress [20], indicating that they might be involved in stress 
signaling transduction pathways. 

Up to now, only four MAPKK genes {SlMAPKKl-4) [37] and 
three MAPKKK genes [MAPKKKa, MAPKKKe, and NPKl) 
[31,38,39] in tomato have been identified. The three gene families 
that involved in MAPK cascade have never been systematically 
investigated in tomato except in our recent report on the MAPK 
gene family [11]. Taking advantage of the a\ ailable tomato 
genome database, we performed a genome-wide search for the 
homologues of the MAPKK and MAPKKK families in tomato. 
Detailed information on the genomic structures, chromosomal 
locations, and sequence homologies of these genes is presented in 
this paper. In addition, the phylogenetic relationships of these gene 
families in Arabidopsis, tomato, rice, and maize were compared. 
Finally, the expression profiles of SIMAPKK and SIMAPKKK 
genes during development and in response to various biotic and 
abiotic stress treatments were investigated through quantitative 
real-time reverse transcription PGR (qRT-PCR) analyses. 

Materials and Methods 

Searching for MAPKK and MAPKKK family genes 

Predicted tomato peptide sequences were downloaded from the 
SGN database (http:/ /solgenomics.net/organism/Solanum_ 
lycopersicum/genome) to construct a local protein database. To 
identify tomato MAPKK, this database was searched using all 
known plant MAPKK protein sequences, including 10 At- 
MAPKKs, 8 OsMAPKKs, 1 1 GmMAPKKs and 1 1 PtMAPKKs, 
as query sequences downloaded from NCBI (http://ncbi.nlm.nih. 
gov), TAIR (http://www.arabidopsis.org), and a rice genome 
database (http:/ /ftp. ncbi.nih.gov/genbank/genomes/Eukaryotes/ 
plants/Oryza sativa/). Similarly, for the tomato MAPKKK gene 
famHy, 80 AtMAPKKK, 75 OsMAPKKK, and 74 ZmMAPKKK 



protein sequences from Arabidopsis, rice, and maize were used as 
query sequences to search against the tomato genome database 
[7,20,21]. The search was carried out using BLASTP, and 50% 
identity was used as the threshold for the sequences obtained from 
BLAST analysis. Self BLAST of the sequences was carried out to 
remove redundancies. The putative functional domains of all the 
sequences were detected by BLASTP of NGBI (http://blast.ncbi. 
nlm.nih.gov), and identified using the Pfam program under a default 
E-value level (0.01) (http://www.Pfam.sanger.ac.uk/) and the 
SMART database (http://smart.embl-heidelberg.de/). Sequences 
without known conser\'ed domains of the MAPKK or MAPKKK 
gene families were excluded from further analysis. Finally, 
predictions of MAPKK and MAPKKK sequences were further 
verified with gene structure, EST, and unigene analyses. Further- 
more, the full-length cDNA sequences of predicted MAPKKs and 
AL^PKKKs in tomato were identified by BLASTN against the 
Kazusa FuU-length Tomato cDNA Database (http://www.pgb. 
kazusa.or.jp/kaftom/blast.html). The isoelectric point (pi) of the 
MAPKK and MAPKKK proteins was predicted using Compute 
pI/Mw software (http://www.expa sych/tools/pitool.html). Sub- 
cellular localization prediction of each of these family gcn(s was 
carried out using the CELLO v2.5 server (http://cello.life.nctu.edu. 
tw/) [40]. 

Multiple sequence alignnnent and phylogenetic analysis 

Multiple sequence alignment for aU the MAPKKs and 
MAPKKKs in Arabidopsis, rice, maize, and tomato was generated 
using ClustalX vl.81 [41]. PlantsP (http://plantsp. genomics. 
purdue.edu/index.html) was used to scan the motifs and domains 
of these kinase protein sequences. Phylogenetic analysis was 
performed using MEGA 4.1 program by the neighbor-joining (NJ) 
method [42], and a bootstrap test was carried out with 1000 
interactions based on the full-length protein sequences. 

Cis-element analysis of putative promoter regions of 
MAPKK and MAPKKK genes 

To investigate cis-elements in the promoter regions of MAPKK 
and MAPKKK genes, 2000 bp of the genomic DNA sequences 
upstream of the transcriptional start site of each MAPKK and 
MAPKKK gene were chosen. These sequences were used to 
search against the PLACE database (http://www.dna.affrc.go.jp/ 
PLACE/) to find the putative cis-regulatory elements. 

Mapping MAPKK and MAPKKK genes on chromosomes 
and gene duplications 

To determine the location of tomato genes on chromosomes, 
the nucleotide sequences of all these genes were further used as 
query sequences for BLASTN search against SGN Tomato Whole 
Genome Scaffold data (2.30) (http://www.sgn.comell.edu/tools/ 
blast/). Finally, the locations of these genes in tomato were 
detected. Synteny analysis of the SIMAPKK and SIMAPKKK 
genes was performed online using PGDD (http://chibba.agtec. 
uga.edu/duphcation/) [43]. Tandem duplications were defined as 
genes located within five loci of each other [44] . 

Plant materials, growth conditions, and treatments 

Tomato [S. lycopersicum L.) cv. Micro-Tom plants used for 
expression analysis from the Tomato Genetics Resource Center 
(University of California, Davis, USA) were grown in growth 
chambers at 26±1°C at 40% to 50% relative humidity with a 
photoperiod of 14 h light/ 10 h dark. Three-week-old seedlings 
with three fully opened leaves were used for all abiotic and biotic 
treatments. The leaves, stems, roots, flower buds (1 d before 
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flowering), and fi"uits (10 d after pollination) were collected fi'om 
flowering plants. All the samples were fi"ozen in liquid nitrogen 
immediately and stored at -75°C until RNA isolation. 

Heat and cold stresses were produced by incubating the 
seedlings at 37±1°C and 4±1°C for 2 h, respectively. Drought 
stress was initiated by withholding water supply to three-week-old 
seedlings after they were fuUy watered. Leaves were harvested after 
withholding water for 7 d when the leaves started to curl because 
of drought stress. Salt stress was produced by adding 200 mM 
sodium chloride to the planter box for 3 h. Control seedlings were 
grown at 26 ± 1 °C with normal irrigation. 

Biotic stress treatment was carried out using Pseudomonas 
syringae pv. tomato DC3000 cultivated in King's B medium. The 
cells were pelleted, resuspended, and diluted in 10 mM MgS04 
and 0.02% Silwet-77 to a concentration of 2x10' CFU ml"' to 
8xl0'^ CFU ml"'. The plants were spray-inoculated until leaf 
surfaces were uniformly wet. Meanwhile, the control seedlings 
were sprayed with ddHaO with 10 mM MgSO^ and 0.02% 
Silwet-77 without bacterial strains. After inoculation, the tomato 
plants were incubated at 26± 1°C in 60 % relative humidity with a 
1 4 h photoperiod for the duration of the experiment. The samples 
were collected 2 h after treatment. 

For hormone treatments, the seedling leaves were sprayed with 
100 mM indole-3-acetic acid (lAA) or 100 mM salicylic acid (SA), 
and sampled at 0, 1, 2, 4, 8, and 16 h intervals [45]. 

Each experiment was repeated three times, and 20 seedlings 
were used in each replication of each treatment. 

RNA extraction and qRT-PCR expression analysis 

The total RNA was extracted using TRIZOL reagent 
(Invitrogen, Germany) according to the manufacturer's instruc- 
tions. The first cDNA strand was generated using a Takara 
Reverse Transcription System (Japan) following the manufactur- 
er's protocol. A maximum of 1 |J.g of RNA was used for each 
reverse-transcription reaction, and a gDNA eraser in the kits was 
used to eliminate DNA to prevent DNA contamination. qRT- 
PCR techniques were employed to characterize the gene 
expression profiles of SlMAPKKs and SlMAPKKKs using die 
primer pairs designed by Applied Biosystems Primer Express 
software (Table SI). To ensure the specificity of each primer to its 
corresponding gene, the primers were submitted to the tomato 
genome database for BLAST search. AU non-specific primers that 
show more 50% percent sequence similarity to multiple regions 
were eliminated and redesigned to minimize potential non-specific 
PCR amplification. Thus, the results from real-time PCR analysis 
might represent the expression pattern of a specific MAPKK or 
MAPKKK gene. Real-time PCR analyses were carried out 
according to the description by Wu et al. [46]. Two biological 
and at least three technical replicates for each sample were 
obtained in the real-time PCR machine (BIO-RAD CFX96, 
USA). To normalize the total amount of cDNA in each reaction, 
the tomato SlUhiB (accession number X58253) gene was co- 
amplified as an endogenous control to calibrate relative expres- 
sion. The Ct method of relative gene quantification recommended 
by Applied Biosystems (PL Applied Biosystems, USA) was used to 
calculate the expression levels of different treatments. Student's t- 
test was used to determine the statistical significance of the 
differential expression patterns between treatments. A heatmap 
was generated by matrix2png using the relative expression data of 
each SIMAPKK and SIMAPKKK gene [47]. 
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Figure 1. Phylogenetic analysis (left), domain organization (middle) and exon-intron structures (right) of tomato MAPKKs. The 

unrooted phylogenetic tree was generated using IVIEGA4.1 program by the neighbor-joining method. Bootstrap supports from 1000 replicates are 
indicated at each branch. The gene names of each subfamily are indicated with the same color. The domain organizations are analyzed by scanning 
of the protein sequences for the presence of known motifs and domains using PlantsP. The exon-intron organization of corresponding SIMAPKK 
genes is represented by yellow boxes and lines, respectively. 
doi:10.1371/journal.pone.0103032.g001 



Results and Discussion 

Identification and sequence analysis of MAPKK and 
MAPKKK genes in tomato 

The published tomato genome database enables the genome- 
wide analysis of the MAPKK and MAPKKK gene famihes in 
tomato [48]. To fmd all the members of these two families, 
BLASTP searches against a local database built using protein 
sequences were performed using these sequences, which contained 
40 MAPKKs from four species, including Arahidopsis , rice, 
soybean, and Populus irichocarpa, and 155 MAPKKKs from 



Arabidopsis, rice, and maize. Only the members with above 50% 
identity were collected. Redundant sequences were removed 
manually. Thus, we found eight candidates for SlMAPKKs and 
103 candidates for SlMAPKKKs. The candidate sequences were 
further evaluated by identifying the putative functional domains of 
through NCBI BLASTP (http://blast.ncbi.nlm.nih.gov/). The 
sequences without the relevant domains or conserved motifs were 
removed. After multiple cycles of these analyses, we identified five 
SlMAPKKs and 89 SlMAPKKKs from the currently available 
tomato. We completed EST hits and a full-length cDNA search to 
verify their existence (Tables 1 and 2). The existence of all 
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Figure 2. Phylogenetic analysis (left), domain organization (middle) and exon-intron structures (right) of 89 SlMAPKKKs in tomato. 

For other details, see Figure 1. 

doi:1 0.1 371/journal.pone.01 03032.g002 
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Figure 3. Alignment of SIMAPKK proteins in tomato. The highlighted part shows the conserved signature motif obtained with the ClustalX 
program. 

doi:1 0.1 371 /journal.pone.01 03032.g003 



MAPKKs was supported by EST hits except SIMAPKK5, and 
two out of five MAPKKs were found in fuU-lengtli cDNA 
sequences. The existence of MAPKKK family genes was also 
supported by EST hits, whereas only 1 1 out of 89 SlMAPKKKs 
were found in full-length cDNA sequences. Given that no standard 
nomenclature is followed for MAPKKKs in plant species, we 
named them sequentially based on their distribution on chromo- 
somes [5,20]. 

The polypeptide lengths of the MAPKK genes ranged from 335 
aa to 515 aa, and their predicted molecular weights ranged from 
37.5 kD to 57.5 kD. The predicted pi has a range of 5.49 to 8.7. 
However, the polypeptide lengths of the MAPKKK genes ranged 
from 290 aa to 1618 aa, and their predicted molecular weights 
ranged from 23.8 kD to 180 kD. The theoretical pi has a range 
from 4.59 to 9.38 (Tables 1 and 2). 

The MAPKK genes were predicted to be localized in the 
cytoplasm, nucleus, and mitochondria. Similarly, most of the 



MAPKKK genes were predicted to be localized in the cytoplasm, 
nucleus, and mitochondria, and others were predicted to be 
localized in the plasma membrane and chloroplast (Tables 1 and 
2). 

Phylogenetic relationship, conserved domain, and gene 
structure analysis 

To further characterize the MAPKKs and MAPKKKs from 
tomato, the kinase domains of tomato were aligned using 
ClustalW and analyzed using MEGA4. Unrooted phylogenetic 
trees were generated from the alignment of the full-length protein 
sequences of all five SlMAPKKs and 89 SlMAPKKKs by the NJ 
and ME methods, and showed similar topologies with only minor 
modifications at deep nodes. Similar to those in Arahidopsis and 
rice [3,4], five MAPKK genes in tomato formed four groups 
(groups A-D) (Fig. 1). Consistent with previous reports on 
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Figure 4. Alignment of MEKK-like SIMAPKKK proteins obtained with the ClustalX program. The highlighted part shows the conserved 
signature motif. 
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Figure 5. Alignment of Raf-iil<e SiiVIAPKKK proteins obtained with the ClustaiX program. The highlighted part shows the conserved 
signature motif. 

doi:10.1371/journal.pone.0103032.g005 



Arabidopsis, rice, and maize [4,20,21], 89 SlMAPKKKs were 
divided into three categories, including 33 MEKK members, 16 
ZIK members, and 40 RAF members (Fig. 2). 

Conserved domain analysis showed a kinase domain in all the 
MAPKKs and MAPKKKs. In the SIMAPKKK family, most of 
the Raf family proteins contained a C-terminal kinase domain and 
long N-terminal regulatory domain except for SIMAPK82. By 
contrast, the majority of the ZIK members had an N-terminal 
kinase domain except for SIMAPK12. Protein tyrosine regions 
were distributed across different subfamily members. A ubiquitin- 
interaction motif and ACT domain functioning in the regulation 
of a wide range of metabolic enzyme activities were found only in 
the RAF subfamily (Fig. 2), which is consistent with the previous 
findings in rice and Arabidopsis [7,20]. 

Based on the predicted sequences, tomato MAPKK and 
MAPKKK gene structures were mapped. SlMAPKKl and 
SIMAPKK3 belonging to group A contained eight exons and 
seven introns, but MAPKK genes from groups C and D usually 
contained no intron (Fig. 1). The gene structures of MAPKKKs 
were highly divergent, even in the same subfamily. The intron- 
exon patterns of these genes showed no obvious similarity among 
the members even in the same group (Fig. 2). However, when 



comparing the intron-exon organization with phylogenetic anal- 
ysis of these genes, we found that there was a relatively good 
correlation between intron conservation and phylogenetic rela- 
tionships. Those genes clustering together on the phylogenetic 
trees often had similar intron-exon patterns (Figs. 2). For example, 
SIMAPKKK46, SIMAPKKK47, and SIMAPKKK84 clustered 
with very high bootstrap (100%) on the phylogenetic tree. 
Meanwhile, all of them contained only one intron (Figs. 1 and 2). 

Sequence alignments of conserved motifs 

All the identified MAPKK genes from Arabidopsis, rice, and 
poplar contain 11 catalytic subdomains [4,49]. MAPKKs were 
also featured by a putative K/R-K/R-K/R-X (l-6)-L-X-L/V/I 
domain as a docking region. This conserved motif could also be 
found in most tomato MAPKK proteins (Fig. 3). 

The plant MAPKKK gene family was formed by three 
subfamilies, each of which contains signature sequences or motifs 
different from those in the other two subfamilies [7] . In this study, 
a conserved motif G (T/S) PX (F/Y/WO MAPEV [20] was found 
in aU 33 putative MEKK genes except SlMAPKKKi, SIM 
APKKK33, and SIMAPKKK83, further confirming that they 
belonged to the MEKK subfamily (Fig. 4). The RAF family is the 



GTTEFMAPE(UV/M)CY/F/L) 



SlMAeKKKSj 

31MAeKKK7J ; 
SlHAPKKl 

SlMAeKKK7S : 

SlHAPKKKiy ; 

SlHAeKKHiy ; 

SIMAPKKK^^ ; 

31MAPRKH7 J ; 

SIHABKRKCJ ; 

SlHAPKKKb : 

31HAPKKH71 ; 
31MAPKR1 

SlMAPKKKi 1 ; 

SIMAPKKKTS : 

SIMAPKKKIH : 






SlMAPKKKi. ; 
S1MAPKKK73 ; 
S1MAPKKK07 ; 
S1MAPKKK7K ; 
SIMAPKKK'I 9 ; 
S1MAPKKK3S 
SlMAPKKKi 9 
S1MAPKKK42 
SIMAPKKKTD 
S1MAPKKK6C1 
S1MAPKKK6 
S1MAPKKK7 1 
al«APKKK27 
alMAPKKKIl 
S1MAPKKK76 
S1MAPKKK12 




Figure 6. Alignment of ZIK-iike SiiVIAPKKK proteins obtained with the ClustaiX program. The highlighted part shows the conserved 
signature motif. 
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Figure 7. Chromosomal distribution of SIMAPKKs and SIMAPKKKs genes in tomato genome. The names of each tandem duplicated gene 
cluster of the two families were indicated with black rectangles. The triangles indicate the upward or downward direction of transcription. 
doi:1 0.1 371 /journal.pone.01 03032.g007 



largest subfamily in tomato and other reported species with a 
conserved signature GTXX (W/Y) MAPE in its kinase domain 
across the members [20]. In tomato, this signature was also found 
in all the members of the RAF family except S1MAPK83, strongly 
supporting their identity as members of the RAF subfamily (Fig. 5). 
The characteristic feature of the ZIK family consists of a conserved 
signature GTPEFMAPE (L/V/M) (Y/F/L) across the members 
[20]. Sixteen MAPKKKs out of 89 members had ZIK specific 
signatures (Fig. 6). 

Chromosomal mapping and gene duplication 

The chromosomal locations and transcription directions of 
tomato MAPKK and MAPKKK genes were determined and 



demonstrated using BLASTN analysis on tomato WGS chromo- 
somes. Interestingly, five SIMAPKKs were distributed on chro- 
mosomes 3 and 12 (Fig. 7). Four of them were located on 
chromosome 3, and the other one was located on chromosome 12. 
Although the SIMAPKKK family genes were distributed over aU 
the 12 chromosomes (Fig. 7), the number in each chromosome 
diflFered, ranging from one (chromosome 5) to 18 (chromosome 7). 

Gene duplication events have an important function in the 
amplification of gene family members in tomato genome. Gene 
families can arise through the tandem amplification or segmental 
duplication of chromosomal regions [50]. In this study, no tandem 
duplicated gene pairs and segmental duplicate families were found 
in the SIMAPKK gene. In the SIMAPKKK gene family, we found 



Table 3. The numbers of SIMAPK, SIMAPKK, and SIMAPKKK in Arabidopsis, rice, tomato, and maize. 



Species MARK MAPKK iviai'kkk j^oI number of MAPKKKs 

MEKK ZIK RAF 

Arabidopsis 20 10 21 11 48 80 

tomato 16 5 33 16 40 89 

rice 15 8 22 10 43 75 

maize 20 14 22 6 46 74 

doi:l 0.1 371 /journal.pone.01 03032.t003 
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Figure 8. Expression profiles of SIMAPKK family genes in tomato using qRT-PCR analysis. A: transcript levels of all 5 SIMAPKK in different 
tomato organs including root, stem, leaf, flower buds, and fruit. B: transcript level change of all 5 SIMAPKK genes in tomato seedlings exposed to heat 
(H), Cold (C), drought (D), salt (S) stresses when compared to control treatment. C: transcript levels of all five SliVlAPKK genes in tomato seedlings 
exposed to Pseudomonas syringes. Data represent the means and standard errors of three independent biological samples. Relative expression levels 
were normalized relative to a reference gene SlUbiS (accession number X58253). Asterisks indicate significant differences as determined by Student's 
t-test (*P, 0.05; *»P, 0.01 ). 
doi:10.1371/journal.pone.0103032.g008 



two clusters (Fig. 7, red box) and 23 segmental duplications. 
Synteny analysis further confirmed the segmental duplications 
among the SIMAPKKK genes. Most of the pairs of segmental 



duplicates were distributed on different chromosomes. Three pairs 
were also distributed on the same chromosome. Even though some 
gene pairs shared high similarity in sequence, such as SIMAPKK2/ 
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Figure 9. Expression patterns of MEKK subfamily genes in different organs and under abiotic and biotic stress treatment in tomato 
by qRT-PCR analysis. For other details, see Figure 8. 
doi:1 0.1 371/journal.pone.01 03032.g009 
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Figure 10. Expression patterns of RAF subfamily genes in different organs and under abiotic and biotic stress treatment in tomato 

by qRT-PCR analysis. For other details, see Figure 8. 

doi:10.1371/journal.pone.0103032.g010 



SIMAPKK4, SIMAPKKK1/SIMAPKKK83, SIMAPKKKIO/SIM 
APKKK56, SIMAPKKK57/SIMAPKKK85, and SIMAPKKK7/ 
SIMAPKKK30/SIMAPKKK39, we found no clear evidence of 
segmental duplication among them (Fig. SI). Thus, the expansion 
of the MAPKKK gene families in tomato might be a consequence 
of whole genome or chromosomal segment duplications. The 
tandem duplications may have slight affection. 



Cis-elements in promoter regions of SIMAPKK and 
SIMAPKKK genes 

Genes responsive to multiple stimuli are closely correlated with 
cis-regulatory elements in their promoter regions [43] . To further 
understand transcriptional regulation and the potential function of 
SlMAPKKs and SlMAPKKKs, cis-elements in their promoter 
sequences were predicted. Many cis-elements involved in plant 
growth and resistance were found in the 2 kb upstream region of 
tomato SIMAPKK and SIMAPKKK genes using the PLACE 
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Figure 11. Expression patterns of ZIK subfamily genes in different organs and under abiotic and biotic stress treatment in tomato 
by qRT-PCR analysis. For other details, see Figure 8 (Left corresponding to upper part; middle corresponding to middle part; Right corresponding 
to lower part). 

doi:10.1371/journal.pone.0103032.g011 



database (http://www.dna.afFrc.go.jp/PLACE/) (Tables S2 and 
S3). One salt-stress (S000453), one heat-stress (S00030), one cold- 
stress (S000407), one wound-stress (S000457), three drought- 



stresses (S000176, S000407, and S000409), and disease resistance 
(S000024)-related cis-elements were all found in the promoter 
regions of both SlMAPKKs and SlMAPKKKs. Moreover, auxin 
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Figure 12. Heat map shows the real-time quantitative RT-PCR (qRT-PCR) analysis results of S/MAPKK genes with exogenous lAA 
(left) and SA (right) treatments. 
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(S000273), GA (S000259), ABA (S000292), and ET (S000037) 
signaling transduction-related cis-elements were found in most of 
die detected sequences (Tables S2 and S3). These results suggest 
that most SlMAPKKs and SlMAPKKKs may participate in 
tomato development and in response to stressful environments. 

Evolutionary patterns and divergence of MAPKK and 
MAPKKK genes in plants 

To further investigate the evolutionary relationships of MAPKK 
and MAPKKK proteins, we compared these two gene families 
between two monocotyledonous (maize and rice) and two 
dicotyledonous plants {Arabidopsis and tomato). Unrooted phylo- 
genetic trees were constructed based on 37 MAPKK and 318 
MAPKKK sequences (Figs. S2 and S3). The numbers of 
MAPKKs and MAPKKKs in different species are indicated in 
Table 3. The AtMAPKKs and AtMAPKKKs were downloaded 
from TAIR. The OsMAPKKs and OsMAPKKKs were down- 
loaded from KOME. The ZmMAPKKKs were downloaded from 
NCBI. Given that the ZmlvIAPKKs have not been reported 
systematically, we identified and analyzed MAPKK family genes 
in maize using the same method for identifying tomato 
SlMAPKKs (Table S4, Figs. S4 and S5). 

Similar to previous studies [3,4], 37 MAPKKs were divided into 
four groups (groups A-D) (Fig. S2). However, no maize MAPKK 
genes belonged to groups B and C (Figs. S2 and S4), thereby 
implying that groups B and C MAPKK proteins might be lost in 
the maize genome after species differentiation. Although group D 
of MAPKKs contained genes from all four species, more than half 
(12 out of 20) came from maize (Figs. S2 and S4). This result 
indicates that the members of group D may have more important 
functions in maize than in other species, and gene expansion in 
this group could lead to a large maize MAPKK gene family. 

All the 3 1 8 MAPKKKs from four different species formed three 
subfamilies, namely, MEKK, RAF, and ZIK (Fig. S3). Most of the 
groups or subfamilies contained members from all four species 
(Table 3, Fig. S3), implying that the genes within these classes were 
derived from a common ancestor. However, the number of 
SlMAPKKKs in the MEKK and ZIK subfamilies was larger than 
that in the same subfamilies in the other three species (Table 3), 
which indicates that gene expansion of tomato MAPKKKs mainly 
occurred in these two subfamilies. 



Expression profile of SIMAPKK and SIMAPKKK genes in 
different tissues or organs 

Expression analysis of SlMAPKKs revealed that most of these 
genes were constitutively expressed because their expression could 
be detected in most selected organs (Fig. 8). Relatively higher 
expression levels for SlMAPKKl and SIMAPKK4 than those for 
other SlMAPKKs were found in tomato organs. The expression 
pattern of these two genes is different from that of their orthologs 
in Aj'ahidopsis (MPSS database) and soybean [5] . The expression 
levels changed markedly among different organs/tissues. For 
example, SlMAPKKl and SIMAPKK4 had the highest expres- 
sion values in the root and stem, respectively, whereas 
SIMAPKK2 had a relatively high expression level in the flower. 

No specific primers could be found to distinguish corresponding 
genes from each other because of the high similarity in nucleotide 
sequence between SIMAPKKK46 and SIMAPKKK4 and be- 
tween SIMAPKKK50 and SIMAPKKK52. Thus, the expression 
patterns of all the tomato MAPKKK genes, except above four 
SIMAPKKK genes were analyzed. The expression of most 
SIMAPKKK genes was detected in all the selected organs 
(Figs. 9, 10, and 11). However, some genes were highly expressed 
in one or several specific organs. Twelve SlMAPKKKs from three 
subfamilies had higher expression levels in the root than that in 
other organs (Figs. 9, 10, and 11). Meanwhile, 13 SlMAPKKKs 
belonging to three subfamilies showed markedly higher expression 
levels in tomato stem than that in other organs (Figs. 10 and 1 1). 
Only three SlMAPKKKs {SIMAPKKK33, SIMAPKKK34, and 
SIMAPKKK3 5) were expressed with high abundance in fruits, 
while they were clustered at the end of chromosome 4 (Fig. 10). 

Expression patterns under various stress conditions 

MAPKK genes in plants are involved in response to various 
biotic and abiotic stresses. In Arabidopsis, MAPKK2 has an 
important function in the cold and salt signaling transduction 
pathway [13,14]. In maize, ZmMAPKK4 is essential for salt and 
cold tolerance because the overexpression of ZmMKK4 in 
Arabidopsis leads to insensitivity to salt and cold treatment [51]. 
Both AtMAPKKl and AIMAPKK2 are associated with plant 
innate immunity [13-15]. In addition, AtMAPKK3 has a function 
in pathogen resistance [17,18]. In tomato, SIMAPKK2 acts with 
SIMPK2, thereby directly contributing to resistance to Xantho- 
monas campestris pv. vesicatoria [52]. In this study, the relative 
mRNA level of five SUVIAPKKs changed significantly under heat, 
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Figure 13. Expression profiles of MEKK subfamily genes witKi exogenous lAA (left) and SA (right) treatments. 
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cold, drought, and salt stresses (Fig. 8). AH five SIMAPKK genes 
were upregialated by heat, cold, and drought treatment, whereas 
SIMAPKK2 and SIMAPKK5 were significandy upregulated by 
salt stress (Fig. 8). By contrast, the expression level of SIMAPKK4 
was also dramatically upregulated (more than fourfold) after P. 
syringae treatment (Fig. 8), which indicates that SIMAPKK4 may 



also have an important function in the defense response to tomato 
pathogens. 

The expression pattern of tomato MAPKKK genes under 
abiotic and biotic treatment was also analyzed in detail. Most of 
the SIMAPKKK genes were significantly upregulated by all four 
abiotic treatments (Figs. 9, 10, and 11), namely, heat, cold, 
drought, and sah. The relative mRNA levels oi SIMAPKKK51 , 
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Figure 14. Expression profiles of RAF subfamily genes witKi exogenous lAA (left) and SA (right) treatments. 

doi:10.1371/journal.pone.0103032.g014 



SIMAPKKK53, and SIMAPKKK55 were upregxilated by more 
than 100-fold after heat or drought treatment (Fig. 9). Meanwhile, 
13 MAPKKK genes showed a more than 10-fold change in 
expression levels under salt treatment (Figs. 9, 10, and 1 1). These 
data indicate that most SIMAPKKK genes were involved in the 
regulation of various abiotic stress signaling transduction path- 



ways. After P. syringae treatment, some SIMAPKKK genes were 
also remarkably upregulated. Especially for SIMAPKKK45 , 
SIMAPKKK48, and SIMAPKKK49, the relative mRNA levels 
were increased by more than 10-fold after treatment (Figs. 9, 10, 
and 1 1), indicating that these SIMAPKKK genes may have special 
functions in plant pathogen resistance. 
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Figure 15. Expression profiles of ZIK subfamily genes with exogenous lAA (left) and SA (right) treatments. 
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The expression patterns of MAPKKK duplicated gene pairs 
were also investigated. Only three pairs {SIMAPKKK12 and 
SIMAPKKK76, SIMAPKKK6 and SIMAPKKK71, and 
SIMAPKKK4 and SIMAPKKK8) and one paralogous pair, 
including SlMAPKKKl, SIMAPKKK83, and SIMAPKKK84, 
shared similar expression patterns in nearly all stress conditions, 
whereas other paralogs were difTerent. Although the duplicated 
SIMAPKKK genes had high similarity in amino acid sequences, 
they may have evolved a different expression pattern and function. 
Similar observations on the plant MAPKKK family have also 
been reported in maize [21] and soybean [5]. 

Expression profiles under lAA and SA treatment 

MAPK cascades interact with or participate in die signal 
transduction of many plant hormones, such as auxin, ethylene, 
abscisic acid, SA, andjasmonic acid (JA) [32,53-55]. In this study, 
the expression patterns of tomato MAPKK and MAPKKK genes 
after exogenous lAA and SA treatment were analyzed in detail. AU 
the SlMAPKKs except SIMAPKK3 and SIMAPKK5 were 
upregulated in response to lAA and SA treatment (Fig. 12). In 
Arabidopsis, MKK7 negatively regulates polar auxin transport and 
subsequently affects plant architecture [56]. In tobacco, the 
overexpression of SIPK enhances ozone-induced ethylene forma- 



tion and blocks ozone-induced SA accumulation [57]. JA can 
activate the MAPK cascade MKK3-MAPK6 and negatively 
regulate ATMYC2/JIN1 expression, thereby controlling Arabi- 
dopsis root growth [18]. All these results provide clear evidence of 
the involvement of the MAPKK pathway in response to hormone 
treatments. 

The mRNA levels of most SlMAPKKKs varied considerably at 
different time periods after exogenous lAA and SA treatment 
(Figs. 13, 14, and 15). Almost half of the MEKK subfamily 
members were markedly upregulated by lAA and SA treatment, 
whereas others showed nearly no change with a relatively low 
expression level (Fig. 13). The RAF subfamily genes shared similar 
expression patterns with the MEKK subfamily (Fig. 14). However, 
all the ZIK subfamily members, except SIMAPKKK5, 
SIMAPK42, and SIMAPK49, had a remarkable response after 
lAA and SA treatment (Fig. 15). These data imply that most 
SIMAPKKK genes may be involved in plant hormone signaling 
during plant development and defense response. In rice, a MAPK 
gene, BWMKl , responds to other plant hormones, such as JA, SA, 
and benzothiadiazole [58] . Using the Aj'ahidopsis leaf protoplast 
transient expression system, Kovtun et al. proved that an oxidative 
stress MAPK cascade can negatively regulate early auxin response 
[59]. However, evidence of the involvement of MAPKKK in 
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hormonal responses is limited. The patterns of interaction between 
the MAPK cascades and hormone signaling pathway need further 
investigation [60]. 

Conclusion 

MAPK cascade family genes should be systematically analyzed 
to understand their functions in plant development and stress 
response. In this study, we present the genome-wide identification 
and analysis of the MAPKK and MAPKKK gene families in 
tomato. Five SlMAPKKs and 89 SlMAPKKKs were identified 
from the available tomato genome. Based on structural charac- 
teristics and a comparison of phylogenetic relationships among 
tomato, Arabidopsis, maize, and rice, all these MAPKK and 
MAPKKK genes were divided into four and three groups, 
respectively. Our results suggest that chromosomal segment 
duplications may be the main factors for the expansion of the 
MAPKKK gene family in tomato. Although nearly all the 
MAPKK and MAPKKK family genes were expressed in all the 
detected organs, some genes were highly expressed in one or 
several specific organs. The expression of most SlMAPKKs and 
SlMAPKKKs could be induced by both abiotic and biotic stress 
treatment. Most of the SIMAPKK and SIMAPKKK genes may 
interact with plant hormones, such as auxin and SA, during plant 
development and defense pathways. Our study could help improve 
the understanding of the complexity of the MAPKK cascade and 
guide future studies for functional analyses. The functions of 
organ-specific and stress-related genes in MAPK cascades and 
interaction with other signaling pathways in tomato are being 
characterized in our laboratory using overexpression and knock- 
down methods. 
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